Go back to the [[AI Glossary]]
#rl
In reinforcement learning, a policy that always chooses the action with the highest expected return.